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[57] ABSTRACT 

The method and apparatus includes a neural network for 
generating a model of an object in a wind tunnel from 
performance data on the object The network is trained from 
test input signals (e.g., leading edge flap position, trailing 
edge flap position, angle of attack, and other geometric 
configurations, and power settings) and test output signals 
(e.g., lift, drag, pitching moment, or other performance 
features). In one embodiment, the neural network training 
method employs a modified Levenberg-Marquardt optimi- 
zation technique. The model can be generated “real time” as 
wind tunnel testing proceeds. Once trained, the model is 
used to estimate performance features associated with the 
aircraft given geometric configuration and/or power setting 
input The invention can also be applied in other similar 
static flow modeling applications in aerodynamics, 
hydrodynamics, fluid dynamics, and other such disciplines. 
For example, the static testing of cars, sails, and foils, 
propellers, keels, rudders, turbines, fins, and the like, in a 
wind tunnel, water trough, or other flowing medium. 
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SYSTEM AND METHOD FOR MODELING 
THE FLOW PERFORMANCE FEATURES OF 
AN OBJECT 

ORIGIN OF INVENTION 

The invention herein was made by employees of the 
United States Government and may be manufactured and 
used by or for the Government for governmental purposes 
without payment of any royalties thereon or therefore. 

BACKGROUND OF THE INVENTION 

1. Technical Field of the Invention 

This invention relates generally to the field of modeling 
the flow features of an object and more specifically to a 
nonlinear system and method for modeling the aerodynamic 
performance features of an object in a wind tunnel. 

2. Description of the Prior Art 

During the design and development of new aircraft, wind 
tunnels are used to generate performance data of the aircraft, 
or portions of the aircraft. Given a sufficient amount of 
performance data, a computer model of the aircraft perfor- 
mance can be constructed from the performance data. The 
model is then used to further test and refine the design and 
development of the aircraft. 

From conception to completion, a significant amount of 
time may be invested in wind tunnel testing. Typically many 
mock-up portions of the aircraft are tested (e.g., wings, nose, 
tail, fuselage, or the entire aircraft), and each portion may be 
tested in varying sizes, e.g., 5%, 10%, 20%, 50%, etc. For 
each test, the aerodynamic effects of a plurality of geometric 
configurations and independent variables associated with the 
object being tested are performed. Such independent vari- 
ables include, for example, flap positions (leading and 
trailing edge), slat positions, deflection angles, elevator 
deflections, rudder deflections, stabilizer position, roll, pitch, 
and yaw positions, angle of attack, and permutations of such 
geometric configurations, and power settings such as mach 
number and velocity (collectively referred to as geometric 
configurations). For each independent variable, a plurality of 
dependent variables or physical measurements of the object 
are made using an elaborate “scale” or balance system 
attached to the object under test Such dependent variables 
include, for example, coefficients of lift, drag, side-force, 
pitching moment, yawing moment, and rolling moment (Q,, 
Cp, Cy, C M , Cjv, C b respectively), lift, and other similar 
physical measurements. 

Given the test performance data, a model is developed 
through extensive manual data fitting, analysis, and inter- 
polation to derive values between the actual measurements 
taken. Evaluation of the aircraft design is generally based on 
a visual inspection of the fitted curves produced. Gross-plots 
from different wind tunnel runs are combined to assemble 
the appropriate data to model the aircraft in flight. Most of 
the data is fit by simple polynomials, limited to 1 or 2 
variables taken at a time. If the engineer needs to have curve 
fits for many variables at once, a linear interpolation of the 
data is performed. However, a simple linear interpolation 
may produce errors if there is insufficient data to determine 
accurate minima and maxima. In addition, this post-test 
analysis is very labor intensive, specific to each model and 
test, and requires large amounts of computing power and 
time. 

Modeling of aircraft requires an exceptionally high degree 
of accuracy, mainly for safety reasons. In order to achieve 
sufficient accuracy in modeling the aircraft, over-testing is 
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performed so that potentially important data is not lost. 
Large scale wind tunnel test time is extremely expensive — 
typically about $6,000 per hour. Thus, any decrease in the 
amount of wind tunnel test time would result in a significant 
5 cost reduction in the design and development of the aircraft, 
and a decrease in the design cycle time. 

Once a model is developed, aircraft developers may use 
the model to determine the optimal geometric configuration 
of the aircraft for a particular flight regime. For example, the 
10 geometric configuration of a transport aircraft may be opti- 
mized for lift so as to reduce required wing area. This 
optimization problem is generally very nonlinear. Thus, the 
linear solutions of the prior state of the art are often 
inadequate. In addition, using the prior art methods of 
15 modeling, a model is not readily available during testing. 
The model is created much later in the design process. Thus, 
after the post-test analysis is complete, subsequent wind 
tunnel tests are required to focus on interesting points in the 
aircraft design. This results in increased expense and design 
20 time. 

It would be desirable to provide an efficient, accurate, and 
robust system and method for testing aircraft in wind tunnels 
by which the number of configurations tested is minimized, 
the model provides a good representation of “in between” or 
25 missing configurations, the post-testing analysis to generate 
a model is eliminated or significantly simplified, and the 
generated model reflects inherent nonlinearities. In addition, 
it would be desirable to provide a model in “real time” so 
that wind tunnel testing can be modified “on the fly” to focus 
30 on interesting points (e.g., maximal lift). 

SUMMARY OF THE INVENTION 

A neural network is used to generate a model of the flow 
35 performance features of an object (e.g., an aircraft or por- 
tions thereof) in a wind tunnel from performance data on the 
object. The neural network is specifically constructed and 
adapted to perform with a high degree of accuracy in 
aerodynamic modeling applications. 

40 Initial pairs of test input signals or conditions (e.g., flap 
positions, slat positions, deflection angles, elevator 
deflections, rudder deflections, stabilizer position, roll, pitch, 
and yaw positions, angle of attack, power settings, and 
permutations of such geometric configurations), and test 
45 output conditions (e.g., lift, drag, side-force, pitching 
moment, yawing moment, and rolling moment, and other 
similar physical measurements) are used to train the neural 
network. 

Various learning methods or modules are used in the 
50 present invention. For example, a modified Levenberg- 
Marquardt optimization method, an average generalization 
error, a cost function with a regularization parameter, and a 
mixture of experts paradigms are used in the present inven- 
tion. As wind tunnel testing proceeds, the neural network 
55 learns a complete static aerodynamic model of the object 
When the model attains a sufficient level of accuracy, 
learning is completed. 

Because the model is generated simultaneously with the 
wind tunnel testing procedure, the model can be immedi- 
60 ately used to dynamically adjust the test procedure to focus 
on interesting points. For example, the model can be used to 
determine optimal geometric configurations of maximum 
lift, high alpha performance, or other flight performance 
features. Test procedures can then be immediately imple- 
65 mented that verify the optimal geometric configurations. 
Thus, a separate wind tunnel test at a later date is not 
required. 
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The invention can also be applied in other similar flow 
modeling applications in aerodynamics, hydrodynamics, 
and other such disciplines. For example, the testing of cars, 
sails, and foils, propellers, keels, rudders, turbines, fins, and 
the like in a wind tunnel, water trough, or other flowing 
medium. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a graphical illustration of a conventional neural 
network. 

FIG. 2 is a flow chart of a conventional method for 
evaluating a neural network. 

FIG. 3 is a flow chart of a conventional method for 
training a neural network. 

FIG. 4 is a block diagram of software and hardware 
components for implementing one embodiment of the 
present invention. 

FIG. 5 is a graphical illustration of one embodiment of the 
neural network in the present invention. 

FIG. 6 is a flow chart of the training method of the present 
invention. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Neural Networks Overview 

Referring now to FIG. 1, a neural network 100 comprises 
a collection of cells, also called nodes or neurons 111, 112, 
113, and connections 120 among the nodes. Nodes are 
generally classified into three types or layers: input 111, 
output 112, and hidden 113. Nodes in the input layer 111 
receive input signals 115 from sources external to the 
network. Nodes in the output layer 112 send output signals 
117 from the network but, in practice, and as seen in FIG. 1, 35 
the output signals 117 are sent by hidden nodes 113. Within 
the network, hidden nodes 113 are connected to input nodes 
111, output nodes 112 and/or other hidden nodes 113 (e.g., 
in a multi-layer network). In the example shown in FIG. 1, 
the network is “fully interconnected” in that each node in 
one layer is connected to every other node in the succeeding 
layers. A network need not be fully interconnected. 

Each connection 120 between two nodes has an associ- 
ated value, called a weight 122. Weights 122 may be either 
positive or negative. In one embodiment weights 122 are 
normalized to values between -1 and 1. 

Referring additionally now to FIG. 2, a high level flow- 
chart of the operation of a neural network is shown, wherein 
functions are indicated by reference numbers which are 
sometimes referred to herein without stating the associated 
nomenclature shown in the related box thereof. Each node 
calculates a “state” value based on output signals (positive 
or negative) from nodes in the layer below, and the associ- 
ated weight 122 of the connection 210. The calculation of 
the state of the node is governed by an associated activation 
function. A typical activation function is the summation of 
the products of each output signal, x t -, multiplied by the 
associated weight value, w f , 122 expressed as: 

( 1 ) 

If a threshold state is achieved (see block 212), the node 
will “fire” an output signal to connected nodes in the next 
layer above (the threshold may also be 0). Otherwise no 
output from the node is made (see block 214). An output 
function associated with the node maps the state of the node 
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216 to an output signal 218. Typical output functions are 
linear, linear threshold, step, or a sigmoid function. 

The nodes in the next layer behave in the same fashion — 
i.e., calculate the node state based on input signal and 
5 weighted connections, and “fire” an output signal if a 
threshold is achieved. 

Referring now to FIG. 3, the neural network 100 is 
“trained” by presenting test input signals 310 to each of the 
input nodes and calculating an output signal according to the 
10 network indicated in block 312 (see FIGS. 1, 2). The 
calculated output signal is compared (see block 314) to a 
known or measured test output signal 316. The set of test 
input signals 310 and test output signals 316 together 
comprise a training set Based on the comparison step 314, 
15 a learning function 318 makes internal adjustments to the 
weights associated with the connections (see block 320) in 
a conventional manner to generate modified weights (see 
block 321). Typical learning functions 318 include the Back 
Propagation Learning rule, Modified Delta Rule, Kohonen 
20 Self Organization, and the like. 

A stopping criterion is checked to determine if further 
training is required (see block 322). Typical stopping criteria 
are based on a measurement of the error between the 
calculated output and the test output signal, or if the number 
25 of iterations exceeds a certain number. If a stopping criterion 
is satisfied, training is complete (see block 324), otherwise 
another test input value is passed to the network and the 
method is repeated. After training, the network can be used 
to respond to random input signals (i.e., input for which 
30 there is no corresponding desired output supplied), and can 
be used for further modeling analysis. 

Neural networks can be implemented in computer hard- 
ware or simulated in software. Large collections of nodes 
and connections are conventionally represented by matrices. 
In one embodiment, the present invention is implemented 
using the hardware and software components shown in FIG. 
4. Processor 402 is a conventional engineering workstation 
or other computer processor such as an Intel 80x86 or 
Pentium central processing unit (CPU), Motorola 680x0 
40 CPU, RISC CPU, and the like. Processor 402 may also be 
coupled to other processors accessible over conventional 
communications channels or buses (not shown). Processor 
402 is conventionally coupled to storage 404 which may be 
a magnetic disk storage, a CD storage unit, or other con- 
45 ventional computer data storage unit. Storage 404 may also 
be coupled to other storage units accessible over conven- 
tional communications channels or buses (not shown). 

Processor 402 is also conventionally coupled to memory 
408 which is a random access memory (RAM) unit or other 
50 conventional computer memory. Items in memory 408 may 
alternatively be stored in storage 404 and accessed by 
processor 402 when required. Memory 408 comprises 
weight matrix 410 for storing the structure of the neural 
network nodes, connections, and weight values associated 
55 with the connections. Memory 408 also comprises neural 
network module 412 for defining the operation of the neural 
network 100, for example, activation functions and output 
functions, executable on processor 402, that define the 
operation of the nodes 111, 112, 113 and connections 120. 
60 Memory 408 also comprises tr ainin g data 414 comprising 
test input signals and corresponding test output signals for 
training the neural network. 

Input means comprises conventional input devices such as 
a keyboard, mouse, track-ball, touchscreen, or other similar 
65 electronic input means 430. A conventional display unit, 
output cable, oscilloscope, printer, or other electronic signal 
output means may also be coupled to processor 402. 
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The preferred embodiment of the present invention may 
be implemented on any platform, operating system, and user 
interface such as: IBM PCs or Compatibles running 
Microsoft Windows; Sun Microsystems workstations run- 
ning Sun OS-SunView; or, DEC VAX running VMS, and the 5 
like. One embodiment uses MATLAB (as a programming 
toolbox) on a Silicon Graphics Indigo EXtream II computer. 
Such systems may be operated in a distributed data/ 
distributed processor environment, if desired 
Wind Tunnel Testing Overview 

Wind tunnels are used to measure a number of perfor- 
mance features of aircraft designs. Typically a scale model 
of a proposed aircraft is subjected to wind tunnel testing, for 
example a 50% scale model of the NASA SHARC research 
aircraft (Super High Alpha Research Concept). Separate 
sections of the aircraft, such as a wing, tail, fuselage, may 15 
also be tested independently. Similar static testing is used in 
other flow modeling applications in aerodynamics, 
hydrodynamics, fluid dynamics, and other such disciplines 
(collectively referred to as flow dynamics). For example, the 
static testing of cars, sails, foils, propellers, keels, rudders, 20 
turbines, fins, and the like in a wind tunnel, water trough, or 
other flowing medium. 

Performance features, or any static dependent variable, of 
the object under test include, for example, lift, drag, lift/drag 
ratio, side-force, pitching moment, yawing moment, rolling 25 
moment, coefficient of momentum, center of mass, maxi- 
mum lift, high alpha performance, and other similar static 
physical measurements. These features are measured via a 
“scale” or balance system attached to the object in the wind 
tunnel. 

The performance features vary as a function of a plurality 
of geometric configurations, or any independent variables, 
associated with the object. Such independent variables may 
include, for example, flap positions, slat positions, deflection 
angles, elevator deflections, rudder deflections, stabilizer 35 
position, roll, pitch, and yaw positions, angle of attack, and 
permutations of such geometric configurations, power set- 
tings such as mach number and velocity, or other indepen- 
dent variables. 

Flow Modeling Neural Network Architecture 

According to the present invention, a model of an object 
in a wind tunnel is generated from a neural network having 
input nodes corresponding to a plurality of geometric con- 
figurations of an object in a wind tunnel, and output nodes 
corresponding to one or more performance features of the 45 
object 

Referring now to FIG. 5, one embodiment of neural 
network 500 is shown. The three input nodes 521, 522, 523 
correspond to the geometric configurations of, for example, 
angle of attack (a), leading edge flap position (LE) and 50 
trailing edge flap position (TE). In one embodiment, LE and 
TB are each tested in six distinct positions (thus there are 36 
possible LE and TE flap combinations), and a is varied in 16 
positions for each of the 36 flap settings. In one embodiment, 
all input signals are scaled to variance 1 (e.g., using a 55 
conventional sigmoid function). 

One output node 531 corresponds to a performance fea- 
ture associated with the object, for example the coefficient of 
drag (Q>), coefficient of lift (Q), pitching moment (C M ), or 
Lift to Drag ratio (L/D). A plurality of performance features 60 
can be simultaneously predicted from a plurality of input 
nodes associated with a plurality of geometric configuration 
nodes in one network. However, for clarity and simplicity, 
one embodiment of the present invention predicts or esti- 
mates each performance feature via a separate neural net- 65 
work with one output node. The output node is governed by 
a linear output function of the form 


F/x)=x. (2) 

A conventional sum of products of the weight and the input 
is used for an activation function (Iw*x t ). In one 
embodiment, all output signals are scaled to variance 1. 

In this embodiment, 15 hidden nodes, generally indicated 
at 541-555, are each governed by a tangent hyperbolic 
activation function of the form 

f/x)=taiih(x). (3) 

The size of the number of data measurements or data 
points in the data set imposes an upper limit on how many 
weights the network should contain. In general, the number 
of data points must exceed the number of weights to ensure 
that generalization is achieved. 

Training 

The training step determines the weight values 122 in 
network 500 from a set of test measurement data. Measure- 
ment data comprises historical or known test inputs signals 
310 and corresponding test output signals 316 (see FIG. 3). 
The set of paired test input and test output signals is referred 
to as the training set In the present invention, a training set 
may comprise, for example, historical or recorded data on 
geometric configuration input signals 521, 522, 523 and 
corresponding performance feature output signal 531 asso- 
ciated with an object in a wind tunnel. 

During training, various combinations of test geometric 
30 configuration input signals 521, 522, 523 are iteratively 
inputted to the neural network, 500, and a performance 
feature output signal 531 is measured. According to an 
iterative learning method, adjustments are made to the 
weight connections as a function of the measured perfor- 
mance feature output signal and the known test performance 
feature output signal 531 corresponding to the input signals 
521, 522, 523. 

A performance feature, y (e.g., drag, lift, L/D, etc.) output 
signal 531, varies as a function, g, of a plurality of geometric 
40 configurations, <j> e.g., angle of attack (a), leading edge flap 
angle (LE), and trailing edge flap angle (TE) 521, 522, 523. 
This is expressed as: 

y=&(4>) (4) 

Measurement errors may also be reflected in one additive 
noise term, e 

y=go(4>)+e (5) 

The function g is unknown. However, the training set, Z N , 
provides a set of known (y, <|>) pair measurements, expressed 
as: 

z N ={[4>V];i=i^N} (6) 

Training set, Z N , is used to train neural network 500 to 
predict the performance feature, y, from 

y-g'(4>) (7) 

The predictor is found from the training set, Z N . 
Expressed in mathematical terms, the network 500 will find 
a mapping from the set of measurements, Z N , to the set of 
functions contained in the chosen network architecture 

g(m 
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z"->©' (8) 

so that value of y’ is close to the true value of y, where 0 is 
the parameter vector of the network (i.e., a vector containing 
all adjustable parameters of the network in a weight matrix). 
The mean square error cost function is used to determine the 
quality of the predictor (e.g., how “close” y' is to y), and is 
expressed as: 

1 N 1 N (9) 

w )=-%r £ l =-hr l x (£, ‘ (e))2 

In another embodiment, a relative absolute error cost 
function is used. 

In mathematical terms, training of network 500 is an 
unconstrained optimization or minimization of the error cost 
function. This optimization problem can be solved using 
conventional nonlinear training methods, for example, back- 
propagation, Baysian, Fuzzy Set, Non Linear regression, or 
other neural network paradigms including mixture of 
experts, CMACS, Radial Basis Functions, directed search 
networks, and functional link nets. 

In one embodiment of the present invention, an adaptation 
of a Levenberg-Marquardt method is used for solving the 
optimization problem (see Mathworks 1993 (“Optimization 
Toolbox for MATLAB,” published by The Math Works Inc.) 
for application in a conventional statistical optimization 
problem) of the error cost function. The Levenberg- 
Marquardt method has several advantages over conventional 
neural network training methods: speed, guaranteed conver- 
gence to a (local) minima, and numerical robustness. The 
Levenberg-Marquardt method is especially well-suited to 
the present invention where the data set is small — e.g., on 
the order of hundreds of data measurements in the tr ainin g 
set. In cases where the data set is large, the Levenberg- 
Marquardt method may be less suitable. In such larger 
applications, a mixture of experts technique generally gives 
good results, scales up well, and can handle non-continuous 
functional regions. 

The Levenberg-Marquardt method exists in numerous 
versions. The simplest version may be found in the original 
contribution of Marquardt (1963) known in the art. The 
version adapted in one embodiment of the invention belongs 
to a class called Trust Region methods discussed in the 
technical article “Practical Methods of Optimization ,” pub- 
lished by Wiley and Sons 1987. The Levenberg-Marquardt 
method used in the present invention is an iterative search 
scheme defined by 

( 10 ) 

From the current iterate 0 ( * } , a new iterate is found by taking 
a step of size \i (k) in the direction h (fc) . Choosing a step size, 
p ( * } , of 1, simplifies the problem. 

FIG. 6 is a flow chart of the training method used in one 
embodiment of the present invention. A search direction, h (jt) 
(a vector), is found by initializing a parameter vector 0 (O) 
(i.e., the weight matrix) (see block 601) and selecting an 
initial random prediction value A (0) (see block 603). 
Alternatively, A (0) may be chosen from a Gaussian distribu- 
tion. The initial parameter vector, 0, may be initialized, for 
example, with random numbers between -1 and 1. In 
another embodiment, the parameter vector is initialized with 
a random Gaussian distribution from -1 to 1. The training 
step can be run several times starting with different initial 
parameter vectors. The set of trained weights leading to the 
lowest minima is then chosen. 
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The search direction, h (k \ is determined (see block 605) 
by solving 




(ii) 


where R represents the Gauss-Newton approximation to the 
Hessian (V"(0), and the error metric V" is a vector of second 
partial derivatives of the Hessian matrix) and is defined by 
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m=jj J f^ex^e)) 7 


( 12 ) 


where \|/*(0) denotes the gradient of the network 500 output 
signal 531 with respect to each of the weights. When the 
15 input is \|/ r 


V(9) = ^- 


(13) 


G(0) in equation 11 is the gradient of the mean square 
2° error cost function (eqn. 9), and is expressed as 


Gf(0®)sV(e®) = J^L 


(14) 


25 


30 


The gradient of the mean square error cost function can be 


determined by 

^=17 *‘W = -TT If £, '< e >= 


(15) 


~7T I Vme) 


The network is evaluated, and V(0 w +h ( * ) ) is determined 
(see block 607) as well as a “prediction” (see block 609) of 
the loss measured by V(0 w )-L(0 (A:) -lli (A:) ). Where L(0*) is an 
estimate of the loss from the Gauss-Newton algorithm, 
V(0)* is from the mean square error cost function, and h is 
the search increment. 

Next, an adjustment in the prediction value, A ( * } , is 
determined (see block 611) as follows: 


If V(0 ( Vk(0 (i V^)>O75[V(0<*VW <A) +^ ) ^ then X<* ) =X ( *V2(16) 
If V(0W)-V(0^+/i^)<O.25[V(e^)-L(e^>+/2(* ) ) then X<*>=U<*>.(17) 

45 * 

For all other values, the initial A is retained and only h is 

changed. 

In the above equations, 

so [V(e(*))-L(0(*> + #*>)] =-L ^ (h&pcK&to) + Mb 2 ) 1 

TfV N (Q (k) +h (k) )<VJQW) (see block 613) then accept 0 ( * +1) 
=0 (: * ) +h ( * ) as a new iterate, and let A5* +1) =A- c * ) 615. 
Otherwise, set k=k+l (see block 620) and go to the deter- 
55 mination of h (eqn. 11 above) and indicated in block 605. 
If a stopping criterion (see block 614) is not satisfied then 
set k=k+l and repeat the method indicated in block 615 by 
determining a search direction, h. If a stopping criterion is 
satisfied indicated in block 617, set and terminate. A 
60 typical stopping criterion is, for example, where the gradient 
of the mean square error cost function gets sufficiently close 
to zero, the mean square error cost function gets below a 
certain value, or the number of iterations exceeds a certain 
value. One or more stopping criterion may be used. 

65 If the prediction value A, is large, the diagonal matrix will 
overwhelm the Hessian matric (R) in equation 11. Thus, in 
the present invention, the search direction, h, approaches the 
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gradient direction, but with a step size close to 0 (similar to 
the back-propagation method). This is an important property 
to ensure convergence, since the mean square error cost 
function always may be minimized by t akin g small enough 
steps in the search direction, h, of the gradient. If k=0 the 
above method coincides with the Gauss-Newton method. 
Instead, in the present invention, X, is decreased if a new 
iterate leads to a decrease in mean square cost function close 
to what is predicted using L(0) ((see block 609 ), equation 
16). Otherwise, A, is increased ( 609 , equation 17). 

Further Modifications 

In another embodiment of the present invention, a differ- 
ent cost function than the conventional mean square error 
function (equation 9) is used. Because the measurements in 
the wind tunnel are corrupted by noise, a better measure of 
the fit is the mean square error on “all possible” measure- 
ments. This measure is known as the generalization error — 
the ability to predict new measurements, not seen during the 
training phase — and is expressed as 


V(§)=E{V(§)} (19) 

However, this value is difficult to compute in practice, and 
does not give information about whether the selected net- 
work architecture is a good choice. 0 depends on the training 
set and is thereby a stochastic variable, which in turn means 
that V(0) is a stochastic variable. Taking the expectation of 
V(0) with respect to all possible training sets of size N, 
yields 


J(M^E{V(9)} (20) 

which is called the average generalization error or the model 
quality measure (see technical article “System Identification 
Theory for the User” of L. Ljung, Prentice Hall 1987). 

Assuming the existence of set of “true” weights in 0 O that 
exactly describe the data-generating function by the chosen 
network architecture (g(<t>;0 o )=go(<t>)X and assuming the 
noise contribution is white noise independent of the inputs, 
an estimate of the average generalization error can be made. 
This estimate is known as Akaike’s final prediction error 
(FPE) estimate (see the previously mentioned Technical 
Article of L. Ljung for a derivation thereof) and is expressed 
as 


/(M) = 


N+J m 

N-d M 


m 


( 21 ) 


where d M is the total number of weights in the network, and 
N is the size of the training set. 

According to eqn 19, as the size of the network is 
increased, the flexibility of the network is increased and a 
better fit can be made for the training data. Thus, V(0) is a 
decreasing function of the number of weights in the network. 
If too much flexibility is added, noise in the data will also be 
captured with the properties of the tr ainin g set in the 
network, and/or the data in the training set will merely be 
memorized and no generalization will occur in the tr ainin g 
process. This is known as overfitting. A conventional solu- 
tion to overfitting is to find the “optimal” network architec- 
ture (an architecture that minimizes J), for example, by 
“pruning” the network (see the technical article “Optimal 
Brain Surgeon and General Network Pruning,” of Hassibi et 
al., published in Proc. of the 1993 IEEE Int Conference of 
Neural Networks, San Francisco, pp. 293-299). Other solu- 
tions to overfitting include, for example, adding an extension 
to the cost function, called regularization (or weight decay) 


(see, for example, the technical article “Adaptive Regular- 
ization ” of Hansen et al., published in the Proc. of the 1994 
IEEE Neural Network in Signal Processing Workshop, 
Greece, and the technical article “Generalization Perfor- 
5 mance of Regularized Neural Network Models ” published 
in the Proc. of the 1994 IEEE Neural Networks in Signal 
Processing Workshop, Greece) 

Applications of the Model 

Once the model is generated, it can be used to determine, 
10 predict or estimate various performance features of the 
aircraft, given an input to the neural network. For example, 
given a plurality of inputs (e.g., flap positions, slat positions, 
deflection angles, elevator deflections, rudder deflections, 
stabilizer position, roll, pitch, and yaw positions, angle of 
attack, and permutations of such geometric configurations, 
15 and power settings such as math number and velocity) the 
neural network can predict performance characteristics such 
as coefficients of lift, drag, side-force, pitching moment, 
yawing moment, and rolling moment (C L , C D , C y , C M , C N , 
C b respectively), and lift. 

20 Conversely, the model can also be used to determine the 
optimal setting of certain geometric configurations and 
power setting. For example, the model can be used to 
determine the optimal flap settings at each angle of attack 
(a) that maximizes lift to drag ratio (L/D). A network having 
25 L/D as an output node, and a, leading edge flap (LE), and 
trailing edge flap (TE) as input nodes may be constructed 
(see FIG. 5). Then, for a given a, LE and TE are varied. The 
optimal output L/D is recorded for each setting of a, and the 
corresponding LE and TE noted. 

30 Using the present invention, approximately 40 to 70 
percent less wind tunnel data is required to derive a model 
of sufficient accuracy to analyze the aircraft performance. 
Thus, less wind tunnel test time is required, and design and 
development costs are significantly reduced. Using the 
35 present invention generates a highly accurate model almost 
equal to the noise level in the original measurement data 
(i.e., the error factor in measuring the performance features 
of an object in the wind tunnel). The model is generated and 
used in seconds, and can be run on desktop PC’s. Conven- 
40 tional numerical aerodynamic modeling takes days or weeks 
and commonly requires a super-computing computer or 
equivalent 
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What is claimed is: 

1. A method for generating a model of one or more flow 
performance features of an object in a flowing medium 
having a plurality of geometric configurations, the method 
comprising the steps of: 

(a) forming a neural network comprising: 
a plurality of input nodes associated with the geometric 

configurations of an object, 
a plurality of hidden nodes, 

one or more output nodes associated with the flow 
performance features of the object, and 
a weight matrix for defining connections between the 
input nodes and the hidden nodes, connections 
between the hidden nodes and the output nodes, and 
weight values associated with each connection, 

(b) forming a test database, coupled to the neural network, 
comprising a plurality of test input signals and corre- 
sponding test output signals, the test input signals 
associated with the geometric configurations of the 
object and the test output signals associated with the 
flow performance features of the object, the test input 
signals associated with the input nodes and the test 
output signals associated with the output nodes, said 
weight values being modified as a function of said test 
input signals; 

(c) solving for a search direction to obtain one of said flow 
performance features comprising the steps of: 

(i) initializing a parameter vector representative of a 
weight value of said one of said flow performance 
features; 

(ii) selecting a random predictor value for said one of 
said flow performance features and having an error 
cost function with a gradient; and 

(iii) iterating the parameter vector and random predic- 
tor value to determine said search direction, said 
determination being satisfied when qualifying one of 
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the conditions selected from the group comprising: 

( 1 ) the gradient of the error cost function is near zero, 

( 2 ) the error cost function is below a predetermined 
value, and (3) a predetermined number of iterating of 
the parameter vector and random predictor value has 
been accomplished. 

2. The method according to claim 1, wherein said error 
cost function is a mean square error cost function. 

3. The method according to claim 2, wherein said mean 
io square error cost function is based on all of the parameter 

vectors and random predictor values used for said iterating 
step. 

4. The method according to claim 3, wherein said mean 
square error cost function is determined by an average 

!5 generalization error technique. 

5. The method according to claim 4, wherein said average 
generalization is established by a Akaike final prediction 
error (FPE) estimate. 

6 . The method according to claim 1, wherein said error 
20 cost function is a relative absolute error function. 

7. The method according to claim 1, wherein said output 
nodes is selected to be of a number which exceeds the 
number of selected weight values. 

8 . The method according to claim 1, wherein the geomet- 
25 ric configurations are selected from the group comprising 

leading edge flap, trailing edge flap, angle of attack, flap 
positions, slat positions, deflection angles, elevator 
deflections, rudder deflections, stabilizer position, roll, pitch, 
and yaw positions, angle of attack, mach number, and 
30 velocity. 

9. The method according to claim 1, wherein the hidden 
nodes produce a tangent hyperbolic activation function 
signal. 

10. The method according to claim 1, wherein the output 
35 nodes produce linear output function signals. 

11. The method according to claim 1, wherein the flow 
performance features are selected from the group compris- 
ing of lift, drag, lift/drag ratio, side-force, pitching moment, 
yawing moment, rolling moment, and maximum lift. 

12. The method according to claim 1, wherein a set of 
paired test input signals and corresponding test output 
signals are generated from wind tunnel testing of the object 

13. The method according to claim 1, wherein said 
iterating step is accomplished by a Levenberg-Marquard 

45 technique. 

14. The method according to claim 1, wherein said 
iterating step is accomplished by a back-propagation tech- 
nique. 

15. The method according to claim 1, wherein said 
50 random predictor value is chosen from a Gaussian distribu- 
tion. 





